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Abstract: The study objective was to evaluate the usefulness of Classification and Regression Trees (CART), to classify 
clinical responders to antidepressant and placebo treatment, utilizing symptom severity and quantitative EEG (QEEG) 
data. Patients included 51 adults with unipolar depression who completed treatment trials using either fluoxetine, 
venlafaxine or placebo. Hamilton Depression Rating Scale (HAM-D) and single electrodes data were recorded at baseline, 
2, 7, 14, 28 and 56 days. Patients were classified as medication and placebo responders or non-responders. CART analysis 
of HAM-D scores showed that patients with HAM-D scores lower than 13 by day 7 were more likely to be treatment 
responders to fluoxetine or venlafaxine compared to non-responders (p=0.001). Youden's index y revealed that CART 
models using QEEG measures were more accurate than HAM-D-based models. For patients given fluoxetine, patients 
with a decrease at day 2 in 6 cordance at AF2 were classified by CART as treatment responders (p=0.02). For those 
receiving venlafaxine, CART identified a decrease in 8 absolute power at day 7 at the P02 region as characterizing 
treatment responders (p=0.01). Using all patients receiving medication, CART identified a decrease in 5 absolute power at 
day 2 in the FPl region as characteristic of nonresponse to medication (p=0.003). Optimal trees from the QEEG CART 
analysis primarily utilized cordance values, but also incorporated some 8 absolute power values. The results of our study 
suggest that CART may be a useful method for identifying potential outcome predictors in the treatment of major 
depression. 
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INTRODUCTION 

Clinicians long have observed a lag time between 
initiating antidepressant treatment and clinical response [1, 
2]. Some individuals do exhibit early symptomatic 
improvement, and this has been reported to predict further 
iinprovement over the next several weeks [3]. Reports have 
suggested that some physiologic changes are seen shortly 
after initiation of treatment [4-6], including studies 
employing a quantitative EEG (QEEG) ineasure, cordance. 
Among QEEG ineasures, cordance is more strongly 
correlated with regional cerebral perfusion than other 
measures [7] and thus can be interpreted in the context of 
neuroimaging studies of brain function during treatment for 
depression. In both open-label case series [8, 9] and with a 
double-blind placebo-controlled clinical trial paradigm [10], 
early changes in prefrontal values of 9-band cordance were 
significantly related to later clinical outcome. There would 
be great clinical use for features — physiologic, symptomatic, 
or otherwise — that could predict outcome as close as 
possible to the initiation of treatment; we sought to 
understand how physiologic biomarkers and clinical data 
could be examined with innovative methods to refine the 
testing of potential predictors. 
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Because of the large number of possible predictors in the 
dataset, we used Classification and Regression Trees 
(CART) [11] to identify parameters that are predictive of 
treatment response. CART is a nonparametric statistical 
method that creates binary decision trees for solving 
classification problems. We compared the results of CART 
analysis with individual item, sub-scale score and total score 
on the Hamilton Depression Rating Scale (HAM-D), with 
CART analysis of QEEG absolute power, relative power, 
and cordance data, at baseline, 48 hours, 7 days, and at 2 
weeks. In an exploratory analysis, we sought to determine 
whether specific HAM-D items, subscales, or total score, 
provided for the best CART model of treatment or placebo 
response, and also to determine which specific electrode 
locations are the best predictors of treatment response. 
Finally, we sought to compare CART analysis results for 
drug-mediated vs placebo ("nonspecific") response. 

METHODS 

Study Patients 

We studied adults diagnosed with a major depressive 
episode (MDE) who were patients in one of two separate, 
double-blind, randomized treatment trials conducted at the 
UCLA Department of Psychiatry and Biobehavioral 
Sciences over a 24 month period: one group of patients 
received fluoxetine or placebo, while patients in the other 
group received venlafaxine or placebo. All patients were free 
of psychotropic medication for at least 2 weeks prior to 
enrollment. Patients were outpatients with unipolar MDE, 
with diagnoses determined using a structured interview for 
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DSM-IV [12], and with scores >16 on the 17-item Hamihon 
Depression Rating Scale (HAM-D) at intake. No additional 
psychotropic medications were permitted during the trials 
(e.g., no sedative or hypnotic medications). Recruitment 
mechanisms as well as inclusion and exclusion criteria were 
identical for both protocols (Table 1). In accordance with 
principles of the Helsinki Declaration of 1975, both 
protocols were approved by the UCLA Institutional Review 
Board, and informed consent to participate in this research 
was obtained from all patients. Patient confidentiality was 
maintained for all patients. 

A total of 5 1 patients completed the protocols and were 
examined for the present report (24 from the fluoxetine vs 
placebo trial, and 27 from the venlafaxine vs placebo trial), 
and had been the subject of our previous report on early 
physiologic changes [10]. Groups (Placebo, Fluoxetine, and 
Venlafaxine) were comparable at pretreatment baseline on 
demographic and clinical parameters. We defined clinical 
response as reduction in final depression severity to a HAM- 
D score of <10 points. The two studies had similar drop-out 
rates, as well as response rates to medication and placebo. 
Because of the high degree of comparability of the patients 
from two trials, data could be pooled. 

Experimental Procedures 

Fluoxetine and Venlafaxine Treatment Trial Designs 

The nine-week protocol employed two phases. Patients 
first entered a one- week, single-blind placebo lead-in phase. 
Patients who failed to meet study entry criteria at the end of 
lead-in (e.g., because of intolerable side effects to placebo, or 
a strong placebo response (no longer meeting inclusion 
criteria)) were removed from the protocol and were referred 
for open-label treatment. Patients eligible to continue in the 
protocol entered a double-blind phase and were randomized 
to receive 8 weeks of either placebo or active medication 
(fluoxetine 20 mg po QD in the first protocol or venlafaxine 
150 mg po QD in the second) dispensed in identical 
capsules. Patients receiving fluoxetine were given 20 mg/d 
and continued at that dose for the 8 weeks; patients receiving 
venlafaxine began at 37.5 mg/d, increased over a week to 
150 mg/d, and then continued at that dose for the remaining 
7 weeks. To preserve blinding, placebo "dose" was escalated 
in that trial. 



All patients received brief sessions of supportive 
psychotherapy during the blinded phase of the study, in order 
to address safety concerns about dispensing placebo alone to 
patients with significant depression (15-25 minutes of 
unstructured counseling and assistance in problem solving 
by a research nurse at the follow-up visits). Follow-up visits 
for symptom/side effect monitoring and for the supportive 
therapy took place at two days and at weekly intervals 
thereafter after the start of the double-blind phase of the 
study. Symptoms were monitored with a focused clinical 
interview, clinician rating scales (e.g., HAM-D), and self- 
rating scales. At the end of the 8 week double-blind phase, 
the blinding was broken and HAM-D scores were used to 
categorize patients as responders or non-responders. 

QEEG Data Acquisition 

QEEG recordings were obtained (a) at pretreatment 
baseline prior to randomization, (b) at 48 hours (after two 
doses of drug or placebo), and (c) after 1 week on 
medication or placebo, as shown in Fig. (1). Recordings 
were made with the QND System (Neurodata, Inc., 
Pasadena, CA), using procedures employed in our previous 
reports and summarized here. Patients were instructed to rest 
in the cycs-closed, maximally alert state, in a quiet room 
with subdued lighting. The technicians monitored the QEEG 
data in real-time during the recording, and re-alerted the 
patients every 30-45 seconds as needed to avoid drowsiness. 
Electrodes were placed with an electrode cap (ElectroCap, 
Eaton, OH) using 35 recording electrodes distributed across 
the head according to the International 10-20 System 
arrangement (Fig. 2). Data were collected using a Pz referen- 
tial montage and were digitized at 256 samplcs/channel/sec 
by the QND system (bandpass filtered 0.3 - 70 Hz). 

Each EEG recording was reviewed by a technician who 
was blinded to the identity of the patient, treatment 
condition, and clinical status; the first 20-32 seconds of 
artifact-free data were selected to be processed. A second 
technician reviewed these selections for accuracy. These 
selections were then processed using a fast Fourier transform 
to obtain absolute and relative power values in four 
frequency bands (0.5-4 Hz, 4-8 Hz, 8-12 Hz, and 12-20 Hz). 
"Absolute power" describes the amount of power in a 
frequency band at a given electrode (measured in ^iV^), and 
"relative power" is the percentage (%) of power contained in 



Table 1. Inclusion and Exclusion Criteria 



Inclusion 


1) current major depressive episode 

2) symptom severity score of 17 or more on tlie 17-item liamilton Depression Rating Scale (HAM-D), with item #1 > 2. 


Exclusion 


1) Bipolar Disorder 

2) any psychotic symptoms (e.g. depression with psychotic features or an Axis I psychotic illness) 

3) dementia 

4) delirium 

5) substance-related disorders, 

6) eating disorders, 

7) cluster A or B Axis 11 disorders 

8) treatment with electroconvulsive therapy (ECT) in the prior six months 

9) any past history of craniotomy, skull fracture, seizures, or significant neurological ilhiess 

10) past history of suicidal intent, plan, or attempt. The development of active suicidal ideation during the study was a criterion for 
unblinding and initiating open-label freatment. 



Identical criteria were employed in both the fluoxetine and venlafaxine trial groups [10]. 
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Fig. (1). Experimental Protocol Timeline. Subjects were assessed and enrolled at "intake", had the pretreatment baseline EEG recorded at 
that time, and then participated in a one week, single-blind placebo lead-in phase. Randomization to treatment modality (active medication or 
placebo) took place at the time marked "start of treatment". Another EEG was recorded after 48 hours of treatment and again at 1 week. 
Clinical assessment to determine outcome (responder vs non-responder) took place after 8 weeks of treatment (9 weeks in study altogether). 
Subjects were monitored weekly during the double-blind treatment (arrows not shown) for clinical changes and adverse reactions [10]. 



a frequency band, relative to the total power across the entire 
spectrum (0.5-20 Hz) computed separately for each 
electrode. The QEEG data were reformatted offline to 
compute linked-ear-reference absolute and relative power 
values; these values were used in the regional ineasure 
analyses described below. QEEG data were also reformatted 
to bipolar channel pairs and processed further to yield 
cordance values (below). 

Cordance Calculations 

Cordance was calculated by combining conventional 
QEEG absolute and relative power measures in a common 
metric, and was computed in three steps using methods we 



have detailed previously [13, 14] and described briefly here. 
First, EEG power values were computed using a re- 
attributional electrode montage (Fig. 2) because that 
montage afforded the highest correlation between EEG 
measures and PET measures of regional cerebral blood flow. 
Second, these values were normalized across all electrode 
sites using a z-transformation, yielding Anorm(s,f) & 
Rnorm(s,f) for all sites s and frequency bands f. Third, 
cordance values (Z) were formed as the sum of Anorm and 
Rnorm. 

Z(s,f) = Anorm(s,f) + Rnorm(s,f) 




Fig. (2). Electrode montage. The 35 scalp electrodes from the extended International 10-20 system. Line segments denote bipolar channels 
used in the reattributional montage. Electrodes included in the calculation of average prefrontal cordance are Fpl, Fp2, and Fpz [14]. 
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Data Analysis and Statistical Methods 
Software 

Software utilized for statistical analysis included SAS 
version 8 (SAS Institute, Cary, NC), and for CART analysis, 
Salford Systems' CART version 5.0 (Salford Systems, San 
Diego CA). 

Patient Categorization 

Patients were categorized both by treatment (fluoxetine, 
venlafaxrne or placebo) and by outcome (response/non- 
response). A final 17-item HAM-D score of <10 at the end 
of the 8 week double-blind phase was used to define the 
responders [10, 15]. As noted earlier, patients from the 
fluoxetine and venlafaxine trials did not differ significantly 
in response rates or demographic parameters and thus were 
pooled for these analyses. Outcomes were evaluated using 
the entire HAM-D scores, and also using two frequently- 
employed unidimensional subscales derived from the full 
HAM-D: the Bech Melancholia subscale [16] (Bech et ai, 
1975) and Maier-Philipp Severity subscale [17]. These were 
considered because, in a meta-analysis by Farics et al. [18], 
they had shown greater size effects and applicability to 
smaller sample sizes than the full 17-item HAM-D. Both 
subscales include items 1 (depressed mood), 2 (feelings of 
guilt), 7 (work and activities), 8 (psychomotor retardation) 
and 10 (anxiety/psychic). The Bech subscale also includes 
item 13 (somatic symptoms/general), while the Maier- 
Philipp subscale includes item 9 (agitation). 

Electrode and Regional Analyses 

Our primary analyses with QEEG measures employed 
data from each of the 35 individual electrodes. Based on 
prior significant findings for the prefrontal region, we also 
examined the prefrontal region by averaging the individual 
values from the three prefrontal channels (FPl, FP2, FPZ) as 
had been done in those earlier reports [9, 10]. 

CART Method 

Data were analyzed using classification and regression 
tree analysis (CART) [11]. This is a novel application of 
CART analysis to clinical and physiologic data in mood 
disorders, and so merits more extensive description. CART 
is a nonparametric statistical methodology that creates binary 
decision trees: the methodology recursively partitions the 
parameter space to classify patients by category with the 
fewest mistakes (termed purity). The binary decision tree 
contains nodes connected by branches. The root node 
contains all patients. A binary decision tree is created by 
partitioning a node into two daughter nodes. The objective of 
the analysis is to find partitions of the data such that the end 
nodes (terminal nodes) are as homogeneous as possible. The 
quantitative measure of node homogeneity is termed the 
impurity function. The simplest idealization of the impurity 
fimction is the number of patients who meet an objective 
criteria divided by the total number of patients in the node. 
Ratios close to 0 or 1 are considered more pure. 

To partition a node, CART examines all possible spUts of 

the explanatory variables. In general, the number of possible 
splits for ordinal or continuous variables is 1 less the number 
of distinctly observed values. A potential split is judged by 
its reduction of the impurity function for both daughter 
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nodes it creates. The partitioning iteratively continues by 
splitting both of the daughter nodes into 2 daughter nodes 
and continues until the tree is saturated, that is, until no 
further partitions can be performed. 

Although the tree grown may fit a given dataset well, it 
may be overfit to that sample and perform poorly with a 
different sample. The optimal number of terminal nodes can 
be determined by cost-complexity pruning [11]. Cost 
complexity is defined as Ros(T) = R(T) + a|T|, where a is the 
cost-complexity parameter, R(T) is the misclassification 
error and |T| is the number of terminal nodes in tree T. 
Brieman has shown that for any value of a, there is an 
optimal tree that minimizes the cost-complexity. To find the 
optimal tree, a cross-validation approach can be used. The 
data set is split into V subsets and a tree is constructed from 
V-1 subsets and validated on the sample that was left out. 
The procedure is repeated V times producing V trees. The 
average of the prediction errors across all trees of a certain 
complexity is used as the final prediction error. The optimal 
tree is selected as the minimum size tree among those trees 
whose cross-validation error is within 1 standard deviation of 
the minimum cross-validation error. 

An advantage of CART is its handling of missing data 
through surrogate variables. For every split, CART 
examines the primary splitter as well as other variable splits 
that minimize impurity. A surrogate split attempts to mimic 
the result of the primary split. Another advantage of CART 
is its handling of variable interactions. The binary tree 
structure shows the effects of variable interactions at the 
optimal splits. Logistic regression is a special case of CART. 
However with a large number of parameters it is 
computationally infeasible to examine every possible subset 
along with their interactions with logistic regression. 
cart's utility is that its algorithm does exactly that and 
determines the combination of variables that best increase its 
predictive power. 

Sensitivity, specificity, and positive- and negative- 
predictive values were calculated using entire trees. We also 
calculated the likelihood ratio of a negative test which is 
defined as (1 -specificity)/ sensitivity, the likelihood ratio of a 
positive test which is defined as sensitivity/ (1 -specificity), 
and Youden's index y which is defined as sensitivity + 
specificity -1, all of which are summary measures of both 
sensitivity and specificity [19]. Models were compared using 
Youden's index where higher values of y are desirable. 
Single node trees were assessed using Fisher's exact test. 

Treatment response served as the primary outcome 
variable used for the analyses. Potential predictors of 
treatment response were HAM-D individual items, Bech and 
Maier-Philipps subscales, and total score at baseline, 2 and 7 
days, QEEG scores at baseline, 2 and 7 days. Models were 
compared using Youden's index y. 

RESULTS 

Clinical Response 

Thirteen of 25 patients (52%) responded to medication: 
seven of 13 patients (54%) responded to treatment with 
fluoxetine, and six of 12 (50%) responded to venlafaxine. 
Response rates to medication between the two studies were 
not statistically different. The placebo response rate was 
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38% (10 of 26). A majority of patients in both responder 
groups had sustained decreases in depression rating scores 
(7/13 M-R and 7/10 P-R). Clinical differences between 
groups did not emerge until 4 weeks of treatment. 
Responders were significantly different from non-responders 
on final HAM-D (F(i,49)=121.4, p< 0.0001). Both responder 
groups had comparable clinical outcomes (final HAM-D 6.0 
± 3.1 for M-R and 6.1 ± 3.4 for P-R); both non-responder 
groups also showed equivalent clinical endpoints (final 
HAM-D 18.4 ± 4.6 for M-NR and 17.9 ± 4.5 for P-NR). 

Logistic Regression Analyses of Clinical Data 

For venlafaxine and fluoxetine combined, medication 
non-responders were more likely to have HAM-D total 
scores greater or equal to 13 compared to treatment 
responders at 1 week of treatment (p=0.001 by Fisher's exact 
test). The sensitivity of this test was 0.73 with 100% 
specificity. The positive predictive value was 1 and the 
negative predictive value was 0.786. Youdcn's index y was 
0.73. The likelihood ratio of a negative test was 0.27. Total 
HAM-D score at week 1 was also significant by logistic 
regression OR=0.762 (CI 0.606, 0.959), p=0.02. We also 
assessed individual HAM-D scores at baseline in terms of 
treatment response, and found no significant association 
between initial HAM-D scores and outcomes. For patients 
receiving placebo, those with scores less than 2 on item 6 of 
the HAM-D scale (late insomnia) at baseline were more 
likely to be placebo responders (p=0.05). The sensitivity of 
this test was 100% and the specificity was 0.47. The positive 
predictive value was 0.47, negative predictive value 1. The 
likelihood ratio of positive test was 1.89 and the likelihood 
ratio of a negative test was 0. Youden's index y was 0.47 
(Table 2). 

CART Analysis of QEEG Data 

QEEG change scores were calculated as change in QEEG 
scores from baseline to time of recording (day 2, day 7, day 
14, day 28 or day 56). Purity was measured using the 
Entropy criteria and 10-fold cross validation was used to 
assess the stability of the trees. The data were also analyzed 
by logistic regression. 

Using the combined sample of fluoxetine and 
venlafaxine, patients with a change from baseline to day 2 in 
8 absolute power at FPl < -0.371 were less likely to be 
treatment responders (P=0.0034). Looking further down the 
classification tree, medication responders were more likely 
to have a change fi-om baseline to day 2 in 8 absolute power 



at FPl > -0.371, and have change from baseline to 7 day for 

9 cordance at AF2 < 0.862. Overall, 11 of 13 treatment 
responders and 12 of 12 treatment non-responders were 
correctly classified. The sensitivity of this test was 0.85 and 
the specificity was 100%. The positive predictive value was 
1 and the negative predictive value was 0.86. The likelihood 
ratio of a negative test was 0.15 and y was 0.85. The CART 
results are given in Fig. (3). 

Each medication was then considered separately. For 

fluoxetine, patients with a change from baseline to day 2 in 9 
cordance at AF2 < -0.336 were more likely to be treatment 
responders than patients with higher changes (p=0.021 by 
Fisher's exact test). Five of 7 treatment responders and 6 of 6 
treatment non-responders were correctly classified. The 
sensitivity of this test was 0.71 and the specificity was 100%. 
The positive predictive value was 1 and the negative 
predictive value was 0.75. The likelihood ratio of a negative 
test was 0.29 and y was 0.71. 

For venlafaxine, patients with changes from baseline to 
day 7 in 8 absolute power at P02 < -0.557 were more likely 
to be treatment responders (p=0.015 by Fishers exact test). 
Five of 6 treatment responders and 6 of 6 treatment non- 
responders were correctly classified. The sensitivity of this 
test was 0.83 and the specificity was 100%. The positive 
predictive value was 1 and the negative predictive value was 
0.86. The likelihood ratio of a negative test was 0.17 and y 
was 0.83. The CART results are given in Fig. (3). Placebo 
non-responders were more likely to have 9 cordance changes 
from baseline to day 7 at T4 < 0.202 (p=0.004). Twelve of 
1 6 placebo non-responders were correctly classified and 9 of 

10 placebo responders were correctly classified. The 
sensitivity of this test was 0.90 and the specificity was 0.75. 
The positive predictive value was 0.69 and the negative 
predictive value was 0.92. The likelihood ratio of a negative 
test was 0.13, the likelihood ratio for a positive test was 3.6 
andy was 0.65. 

When the average of absolute power, relative power, or 
cordance values from the 3 individual prefrontal leads were 
used to examine the prefrontal region as had been done 
previously [10], the CART method identified no significant 
relationship between these variables and clinical outcome. 

DISCUSSION 

Three central findings emerged from this preliminary 
study. First, the CART method was able to identify clinical 
features during treatment that were significantly related to 
outcome. Second, CART confirmed that the use of prefi^ontal 



Table 2. Summary of Statistical Measures 



Model 


Sensitivity 


Specificity 


PPV 


NPV 


■y 


HAM-D CART-F and V 


0.73 


1 


1 


0.786 


0.73 


HAM-D CART-Pbo 


1 


0.47 


0.47 


1 


0.47 


QEEG CART- F and V 


0.85 


1 


1 


0.86 


0.85 


QEEG CART-F 


0.71 


1 


1 


0.75 


0.71 


QEEG CART-V 


0.83 


1 


1 


0.86 


0.83 


QEEG CART-Pbo 


0.9 


0.75 


0.69 


0.92 


0.65 



F = fluoxetine, V = venlafaxine, Pbo = placebo, PPV = positive predictive value, NPV = negative predictive value, Y= Youden's index y. 
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Fig. (3). QEEG CART tree of fluoxetine and venlafaxine. A2_DA_FP1 
A7_TZ_AF2 - the change from basehne to day 7 for G Z score at AF2. 



the change from basehne to day 2 for 5 absolute score at FPl. 



physiologic data is useful in predicting clinical outcome, 
though with refinement in the regional specificity compared 
with prior work. Third, response to placebo was associated 
with changes in temporal activity, a new observation. We 
believe this report is the first use of CART to examine 
clinical and physiologic features of response to 
antidepressant medications, and find that this method merits 
further use in psychiatric research. 

These results are based on a small sample of patients and 
further studies are necessary to confirm the findings. One 
should also note that CART also differs from traditional 
statistical tests in that the objective function for CART is to 
maximize purity in the groups and not to minimize the error 
in fitting a functional form. CART also recursively examines 
every variable as well as the optimal split point for the 
variable. The hierarchical nature of the tree also yields 
information about interactions. Owing to the recursive nature 
of CART, it is not amenable to traditional notions of p- 
values. 

Applying CART to HAM-D score data, we found that 
patients taking fluoxetine and venlafaxine were more likely 
to be treatment responders if the total HAM-D score was less 
than 13 at 1 week of treatment. We also found that patients 
who were randomized to placebo and had a score less than 2 
on item 6 of the HAM-D were more likely to respond to 
placebo. When we used CART to examine the QEEG data 
we were able to obtain models that had higher accuracy for 
both treated and placebo patients. 



All of the QEEG data, CART models selected cordance 
and absolute power measures. Of interest, relative power 
values were never chosen as optimal predictors. However 
this is a relatively small sample size, so caution must be used 
in any generalizations. 

Cook et al. [10] found that medication responders 
uniquely showed significant decreases in prefrontal cordance 
at 48 hours and 1 week, and that no other brain regions had 
significant changes. In this study, CART analysis found 
significant results utilizing single point electrode data within 
the prefrontal region examined in that prior report. For the 
optimal CART tree investigating treatment responders versus 
non-responders, for the initial CART split value, 12 out of 13 
treatment responders had change from baseline to day 2 8 
absolute power at FPl > -0.371, while 8 out of 12 treatment 
responders had scores < -0.371. 

In the optimal categorization tree identified by CART, 9 
cordance and 6 absolute power were the only QEEG 
parameters that were included. Cordance values may be 
interpreted as reflecting regional cortical perfusion, so that a 
lower cordance value may reflect less regional activity in the 
area recorded by the electrode, while a higher value may 
reflect increased activity. For the optimal CART tree 
investigating treatment responders versus non-responders for 
both medications combined, 11 of 12 treatment responders 
had a lesser increase, or greater decrease, in cordance from 
baseline to day 7 at AF2 than in the treatment non- 
responders. 
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For fluoxetine we found that patients with a change in 
cordance from baseline to day 2 at AF2 < -0.336 were more 
likely to be treatment responders than patients with higher 
changes: fluoxetine treatment responders may have had a 
greater decrease in cortical perfusion, and neuronal activity, 
in the area recorded by the AF2 electrode. 

Placebo non-responders were more likely to have 
cordance changes from baseline to day 7 at T4 < 0.202: 
placebo responders were more likely to have a greater 
increase in cortical perfusion, and neuronal activity, in the 
area recorded by the T4 electrode. 

These findings are consistent with findings from other 
researchers showing a decrease in prefrontal cortex 
metabolism with successful treatment of major depressive 
disorder with paroxetine [20]. In addition, the CART 
findings regarding placebo responders may provide 
complementary information about placebo treatment to the 
findings in a previous study from our lab with the same 
patients that showed cordance changes suggestive of 
increased prefrontal cortical perftision in placebo responders 
[21]. 

The different electrodes selected for optimal CART 
analysis trees for the different medications may reflect the 
effect of the specific medications on specific areas of the 
brain (and involved pathways), and in relationship to 9 
cordance changes, may reflect changes in cortical perfusion 
induced by the specific medications that have relevance for 
understanding the mechanisms underlying treatment 
response (vs non-response). 

Of interest, out of 35 electrodes providing data, 4 
electrodes provided data selected for optimal CART trees, 
and all but 1 (FPl) were from outside the prefrontal region. 
However, data from FPl was used as the primary split in the 
optimal CART tree for fluoxetine plus venlafaxine treatment 
responders vs non-responders. 

Prior work with cordance [9, 10, 21] had identified 
changes in the prefrontal region as biomarkers of treatment 
response; those studies tested regional hypotheses and 
explicitly did not consider the 35 individual electrodes in 
order to avoid Type II statistical errors. In our present 
dataset, only prefrontal data from FPl was included in the 
optimal CART trees, along with AF2, P02 and T4 
electrodes. These findings suggest that, in addition to 
considering regions defined in classical neuroanatomical 
frameworks, future studies should test hypotheses about 
ensembles of electrodes that may be linked by distributed 
neuronal networks [22]. 

While significance was noted with HAM-D CART 
analysis, the results could be viewed as reflecting a lesser 
likelihood of treatment response if patients start with a 
higher HAM-D score. Optimal QEEG CART analysis in this 
study found most of the significant electrode points in areas 
other than the prefrontal cortex, and calculated values for the 
entire prefi"ontal region were not found to be significant. 
These results are not in conflict with the results of a prior 
study with the same data that only noted significance of the 
prefrontal cortex when regions of the brain were analyzed 
[10], and suggest activity at other locations in the brain, such 
as from the anterior frontal region, may add useful 
information for predicting active treatment or placebo 



response. The application of CART methodology was found 
to be a useful exploratory tool in investigating QEEG data 
obtained before and during antidepressant and placebo 
treatment of unipolar depression. These preliminary results 
need to be replicated with studies with larger patient 
numbers. 

ABBREVIATIONS AND TECHNICAL TERMS 



Absolute power 
AF2 region 

Alpha-band 
Beta-band 
Clinical responder 

Cordance 



6 (delta) band 
FPl region 

Negative Predictive Value = 
P02 region 

Positive Predictive Value = 
QEEG 

Relative power 



The amount of power in a 
frequency band at a given 
electrode (measured in nV^) 

The part of the anterior 
frontal brain region that is 
recorded by the AF 1 
electrode (see Fig. 2) 

EEG data in the frequency 
band from 8 up to 12 Hz 

EEG data in the frequency 
band fi-om 12 up to 20 Hz 

A final 17-item HAM-D 
score of <= 10 at the end of 
the 8 week double-blind 

phase 

A value calculated by 
combining conventional 
QEEG absolute and relative 
power measures in a common 
metric 

EEG data in the frequency 
band from 0.5 up to 4 Hz 

The part of the prefrontal 
brain region that is recorded 
by the FPl electrode (see Fig. 
2). 

The proportion of patients 
who are predicted to be non- 
responders who are actually 
non-responders 

The part of the parieto- 
occipital brain region that is 
recorded by the P02 
electrode (see Fig. 2) 

The proportion of patients 
who are predicted to be a 
treatment responder that are 
actually treatment responders 

Quantitative EEG, which 
involves computer signal 
processing of EEG data 

The percentage (%) of power 
contained in a frequency 
band, relative to the total 
power across the entire 
spectrum (0.5-20 Hz) 
computed separately for each 
electrode 
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Sensitivity 



Specificity 



e (theta) band 
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